Presentation

Introduction to Subject

R. intestinalis survives post-glucose (and post-mucin-degradation products) depletion by entering a slow-growth mode.

A combination of targeted metabolomics and RNA-seq showed that the slow growth mode of R. intestinalis represents a diauxic shift towards acetate and lactate consumption (both leading to butyrate production).

R. intestinalis consistently increased the expression of genes involved in glycolysis, pyruvate utilization, mucin hydrolysis, galactose and mannose utilization, and the butyrate-producing process after depletion of glucose/pyruvate.

Data overview

  • Fermentation

    • containing 888 experiments with data:
    • Exp, Replicate, Mucin, Species.combi, T0h, T4h, T6h, T8h, T10h, T12h, T14h, T16h, T20h, T24h, T28h, T32h, T36h, T38h, T40h, T42h, T44h, T48h, T56h, T60h, T72h, T86h, T96h, T120h, T144h, T156h, T168h, T192h, T216h, T240h, T264h, T288h
  • RNA expression data

    • containing recorded expression of 4262 genes in 45 experiments
  • Annotation for expression dataset

    • Coupling internal gene ID’s to gene names , KO ID’s, and COG ID’s
  • Meta-data for fermentation dataset

    • Coupling RNA expression samples with fermentation batches and time points

Data Cleaning and Augmentation

Removing empty time-points from fermentation dataset

ri_fermentation_data |>
  filter(Exp == "RI_batch_WC1") |>
  select( "Rowname", (starts_with("T") & ends_with("h")) ) |>
  select(where(~!all(is.na(.x)) ) ) |>
  write_csv("../data/02_fermentation_data_clean.csv")

Subsetting RNA expression to wild type R. intestinalis only

ri_expression_data |> 
  pivot_longer(cols = -X,
               
               names_to = "sample",
               values_to = "expression") |> 
  inner_join(ri_metadata, by = c("sample" = "GC.code" )) |> 
  filter(Exp == "RI_batch_WC") |> 
  select(X,expression,sample) |> 
  pivot_wider(names_from = sample,
              values_from = expression) |> 
  write_csv("../data/02_data_clean.csv")

Analysis Results: Co-Expression

  • Input data was a total of 9 samples across 3 time conditions (with 3 replicates each)
  • Matrix reveals co-expression patterns in R. intestinalis gene expression data
  • Co-regulated genes can potentially contribute to the same pathways

Analysis Results: Clustering

  • K-means clustering was applied to the correlation matrix from the previous slide
  • There is a clear separation of genes in the 2D space
  • Genes are grouped into 3 clusters (arbitrary cutoff for regulation validation)

Top KO Codes per Cluster

  • Annotation of R. intestinalis genome enables matching of Gene ids with KO terms
  • Mapping of KO terms to pathways enables the calculation of pathways percentages across clusters
  • As expected, some pathways are represented across all clusters, whereas others are limited to 1 or 2 clusters.

Differential Expression

Expression

Top Counts

## Fermentation

Full Fermentation

Conclusion